upcScavenger » Computer Memory » Wiki: Memory Timings

Memory timings

( Computer Memory )

Additional timings
Relationship to bandwidt..
Handling in BIOS
See also
References
External reading

C O N T E N T S

Clock Latency Memory Timings

Rank: 100%

Wiki
Comments
Media

Memory timings or RAM timings describe the timing information of a memory module or the onboard LPDDRx. Due to the inherent qualities of VLSI and microelectronics, memory chips require time to fully execute commands. Executing commands too quickly will result in data corruption and results in system instability. With appropriate time between commands, memory modules/chips can be given the opportunity to fully switch transistors, charge capacitors and correctly signal back information to the memory controller. Because system performance depends on how fast memory can be used, this timing directly affects the performance of the system.

The timing of modern synchronous dynamic random-access memory (SDRAM) is commonly indicated using four parameters: CL, T_RCD, T_RP, and T_RAS in units of ; they are commonly written as four numbers separated with hyphens, e.g. 7-8-8-24. Variations include:

The fourth (t_RAS) is often omitted.
Or a fifth, the Command rate, is sometimes added (normally 2T or 1T, also written 2N, 1N or CR2).

These parameters (as part of a larger whole) specify the clock latency of certain specific commands issued to a random access memory. Lower numbers imply a shorter wait between commands (as determined in clock cycles). The Intel systems also have Gear 2 (Gear type 0) and Gear 4 (Gear type 1).

CAS latency	CL	The number of cycles between sending a column address to the memory and the beginning of the data in response. This is the number of cycles it takes to read the first bit of memory from a DRAM with the correct row already open. Unlike the other numbers, this is not a minimum, but an exact number that must be agreed on between the memory controller and the memory.
Row Address to Column Address Delay	T_RCD	The minimum number of clock cycles required between opening a row of memory and accessing columns within it. The time to read the first bit of memory from a DRAM without an active row is T_RCD + CL. Some memory controllers break this down into two values, T_RCDwr (write) and T_RCDrd (read).
Row Precharge Time	T_RP	The minimum number of clock cycles required between issuing the precharge command and opening the next row. The time to read the first bit of memory from a DRAM with the wrong row open is T_RP + T_RCD + CL.
Row Active Time	T_RAS	The minimum number of clock cycles required between a row active command and issuing the precharge command. This is the time needed to internally refresh the row, and overlaps with T_RCD. In SDRAM modules, it is simply T_RCD + CL. Otherwise, approximately equal to T_RCD + 2×CL.
Notes: RAS : Row Address Strobe, a terminology holdover from asynchronous DRAM. CAS : Column Address Strobe, a terminology holdover from asynchronous DRAM. T_WR : Write Recovery Time, the time that must elapse between the last write command to a row and precharging it. Generally, T_RAS ≈ T_RCD + T_WR. T_RC : Row Cycle Time. T_RC ≥ T_RAS + T_RP.

What determines absolute latency (and thus system performance) is determined by both the timings and the memory clock frequency. When translating memory timings into actual latency, timings are in units of clock cycles, which for double data rate memory is half the speed of the commonly quoted transfer rate. Without knowing the clock frequency it is impossible to state if one set of timings is "faster" than another.

For example, DDR3-2000 memory has a 1000 MHz clock frequency, which yields a 1 ns clock cycle. With this 1 ns clock, a CAS latency of 7 gives an absolute CAS latency of 7 ns. Faster DDR3-2666 memory (with a 1333 MHz clock, or 0.75 ns exactly; the 1333 is rounded) may have a larger CAS latency of 9, but at a clock frequency of 1333 MHz the amount of time to wait 9 clock cycles is only 6.75 ns. It is for this reason that DDR3-2666 CL9 has a smaller absolute CAS latency than DDR3-2000 CL7 memory.

Additional timings

For DDR3 and newer (DDR4, DDR5), the four timings described earlier are not the only relevant timings and give a very short overview of the performance of memory. The full, manufacturer-verified timings of a memory module are stored inside of a module's Serial Presence Detect (SPD) chip. On DDR3 and DDR4 DIMM modules, this chip is a Programmable ROM or EEPROM flash memory chip and contains the JEDEC-standardized timing table data format. See the SPD article for the table layout among different versions of DDR and examples of other memory timing information that is present on these chips.

On some DIMM modules, there are also "verified" overclocking profiles in the SPD using XMP or Expo formats. They indicate faster timing information (and higher voltages) that the manufacturer has verified to work with the module: higher voltages tend to reduce memory latency (time-to-stabilize) at a cost of more heat production. The BIOS on a PC may allow the user to choose among JEDEC profiles, XMP/Expo profiles, or define their own timing adjustments in an effort to increase performance (with possible risk of decreased stability).

On Alder Lake CPUs and later, tRCD and tRP are no longer linked, while before Intel did not allow to set them to different values.

DDR4 introduced support for FGR (fine granular refresh), with its own tRFC2 and tRFC4 timings, while DDR5 retained only tRFC2.

Relationship to bandwidth

Memory bandwidth measures the throughput of memory, and is generally limited by the transfer rate, not latency. By interleaving access to SDRAM's multiple internal banks, it is possible to transfer data continuously at the peak transfer rate. It is possible for increased bandwidth to come at a cost in latency. In particular, each successive generation of DDR SDRAM has higher transfer rates but the absolute latency does not change significantly, and especially when first appearing on the market, the new generation generally has longer latency than the previous one. The architecture and bugs in the CPUs can also change the latency.

Increasing memory bandwidth, even while increasing memory latency, may improve the performance of a computer system with multiple processors and/or multiple execution threads. Higher bandwidth will also boost performance of integrated graphics processors that have no dedicated video memory but use regular RAM as VRAM. Modern x86 processors are heavily optimized with techniques such as superscalar instruction pipelines, out-of-order execution, memory prefetching, memory dependence prediction, and branch prediction to preemptively load memory from RAM (and other caches) to speed up execution even further. With this amount of complexity from performance optimization, it is difficult to state with certainty the effects memory timings may have on performance. Different workloads have different memory access patterns and are affected differently in performance by these memory timings.

Handling in BIOS

In Intel systems, memory timings and management are handled by the Memory Reference Code (MRC), a part of the BIOS, which adjusts the operation of the CPU's integrated memory controller. A lot of it is also managed in Intel MEI, a Minix OS that runs on a dedicated core in PCH. Some of its sub-firmwares can have effect on memory latency.

The analogous part of the BIOS in modern AMD systems is the AGESA. The memory controller is integrated into the CPU/APU as part of the Infinity Fabric on-chip interconnect.

Memory timings

( Computer Memory )

Account

Navigation

Statistics

Memory timings ( Computer Memory )

Account

Navigation

Statistics

Memory timings

( Computer Memory )